Content
- Introduction to Quarto
- What is Reproducible Research
- A brief introduction to R and RStudio
- Introduction to Quarto
- Building the course webpage by using Quarto
- Doing Data Science with tidyverse
School of Computing and Information Systems,
Singapore Management University
21 Feb 2023
Research is considered to be reproducible when the exact results can be reproduced if given access to the original data, software, or code.
Reproducible research is sometimes known as reproducibility, reproducible statistical analysis, reproducible data analysis, reproducible reporting, and literate programming.
Source: https://www.displayr.com/what-is-reproducible-research/
The “what” that needs to be reproduced is typically:
Tables
Visualizations/figures/graphs
Values reported in the text
R is a powerful language and environment for statistical computing and graphics. It is a re-implementation of the S language, which was developed in the 1980’s.
R is a high level language. The core language has some superficial similarities to C, but many things are handled automatically in R that are not in C.
It is a free and open source software (FOSS) under the terms of the Free Software Foundation’s GNU General Public License in source code form.
It compiles and runs on a wide variety of UNIX platforms and similar systems (including FreeBSD and Linux), Windows and MacOS.
It is available from The Comprehensive R Archive Network
An effective data handling and storage facility,
A suite of operators for calculations on arrays, in particular matrices,
A large, coherent, integrated collection of intermediate tools for data analysis, graphical facilities for data analysis and display either on-screen or on hardcopy, and a well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities.
It is highly extensible and it has thousands of well-documented extensions (named R packages) for a very broad range of applications areas such as finance, business, economic, biostatistics and etc (As of August 2022 ~18471 R packages).
It has a vast community both in academia and in business such as stack overflow and RStudio Community.
Download R installer by visiting one of the following links:
Install R by clicking on the installer. If necessary, provide the installer administrator right. Install R in the root directory when prompted.
After the installation completed, check the environment variable of your computer. If R path is not defined, you should update the path manually.
A toolchain bundle used for building R packages from source (those that need compilation of C/C++ or Fortran code) and for build R itself.
Download RTools from this site.
After the installation complete, check the environment variable of your computer to ensure that RTools path is there.
A method of literate programming that embed code and output from Python, R, and JavaScript via integration with Jupyter, Knitr, and Observable.
A variety of extensions to Pandoc markdown useful for technical writing including cross-references, sub-figures, layout panels, hoverable citations and footnotes, callouts, and more.
A project system for rendering groups of documents at once, sharing options across documents, and producing aggregate output like websites and books.
Authoring using a wide variety of editors and notebooks including JupyterLab, RStudio, and VS Code.
A visual markdown editor that provides a productive writing interface for composing long-form documents.
Learn more about Quarto at https://quarto.org.
Quarto Guide